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Abstract 

Consider a sequence (indexed by n) of Markov chains z '"'(•) in R d characterized by transition kernels 
that approximately (in n) depend only on the rescaled state rC^z ^ n '(r). Subject to a smoothness condition, 
such a family can be closely coupled on short time intervals to a Brownian motion with quadratic drift. This 
construction is used to determine the first two terms in the asymptotic (in n) expansion of the probability 
that the rescaled chain exits a convex polytope. The constant term and the first correction of size Bin- 1 ' 6 ) 
admit sharp characterization by solutions to associated differential equations and an absolute constant. The 
error is smaller than 0(n~ v ) for any r\ < 1/4. 

These results are directly applied to the analysis of randomized algorithms at phase transitions. In 
particular, the 'peeling' algorithm in large random hypergraphs, or equivalently the iterative decoding scheme 
for low-density parity-check codes over the binary erasure channel is studied to determine the finite size 
scaling behavior for irregular hypergraph ensembles. 

1 Introduction 

We deal with a sequence of Markov chains z^ n \-) taking values in a countable set S C R d , started at an initial 
distribution z(0), and with transition probabilities given by a sequence of kernels 

W n (A\z^(r)) = ¥(z^(t+ 1) - ?W( T ) = A|f( n )(r)) (1.1) 

satisfying W n (A|z) ss W(A\n^ 1 z), where W has compact support in the first coordinate and is smooth in the 
second coordinate. Markov chains in this class are in a sense 'slowly- varying': they make jumps of size 6(1), but 
the kernel depends only on transitions on the scale of 0(n). The study of pure-jump Markov processes with an 
analogous property (transition intensities of the form n/?(n -1 -)) goes back at least to [Ku70]. They have since 
been widely applied to population processes in epidemics, queueing, or networks (cf. [Ku81], [AB00], [Wh02] 
and references therein), as well as models for chemical reactions (cf. [BKPR06] for a recent contribution). 
These applications motivate the name density dependent, since n~ x z (") can be thought of as the density of a 
population of z ^ individuals living in an area of size n. Another application, particularly relevant here, is the 
analysis of randomized algorithms arising in (probabilistic) combinatorics and optimization (cf. [Wor95] and 
others). 

Letting z(0) = nyo for a non-random (for now) yo, it is known that the rescaled, interpolated process 
n~^z^ n \n ■) on compact time intervals [0,0] concentrates near (its fluid limit) the solution to the ordinary 
differential equation 

%e) = F(y(9)): m)=Po, (1-2) 
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Figure 1: Left panel: The interesting case of the vector field F tangent to the boundary <9D. Two realizations 
of the chain rT x z •) (dotted lines) remain within distance 0(y/ (log n)/n) of the deterministic evolution 
y (solid parabola); one realization exits the domain, while the other remains in the interior. Right panel: 
Example setting in R 2 . Here D is a rectangle and the critical trajectory started at yo touches the boundary of 
D at N = 3 critical times Q\, 9 2 c , and d\ 



where F(x) = A VK(A|a?) . It is a consequence of the results on convergence of Markov semigroups that 
the scaled fluctuation of the process about this solution, y/n \n~' 1 z(n ■) — y(-)}, converges weakly to a diffusion. 
Both results are special cases in the rich literature on convergence of Markov processes: cf. [EK86, Kal02] for 
general theory, [Wh02] as well as work of M. Bramson, M. Harrison, and R. Williams for fluid and diffusive 
limits in qucucing theory, and [DN08] for a recent survey of fluid limits with quantifiable error probabilities (in 
the spirit of Lemma 2.4 here). 

In view of these results, one can think of the rescaled process as a random perturbation of the vector 
flow F and the question arises of how randomness changes the dynamics. Systems with Gaussian or Markov 
perturbation are a central theme of [FW84] (where among other results large deviation estimates for the place of 
exit from a domain D are derived) and research following from there, of which the most relevant to the problem 
to be defined here is the case of characteristic boundaries (cf. [Day89] and others). Asymptotic expansions of 
probabilities for chains with two time-scales, i.e. a random perturbation of a Markov chain have been surveyed 
in [YZ05]. There are also extensions of fluid limits with error estimates, cf. [Tur07] for a recent contribution. In 
contradistinction to these results, we are interested in the fine asymptotics of the probability that the rescaled 
Markov chain exits a domain D before some fixed time 9 > and when this probability does not tend to either 
or 1 , which happens when the vector field F is tangent to the boundary. This is apparent from the left panel 
of Figure 1 , where the bound coming from the fluid limit estimate does not prevent the chain from exiting the 
domain, as it would should the deterministic evolution y remain in the interior of D. 

To investigate this probability as a function of the initial condition, we introduce the following construction. 
Let p <— > y p be a smooth parametrization (from (—5, S) into D) of the initial condition, such that the solution of 
(1.2) with y(0) = 2/0 remains in the interior D° of D except at finitely many critical times 6 % c , i < N, when the 
trajectory is tangent to the boundary. (See Figure 1 for an example and Section 1.1 for a rigorous statement.) 
It follows from the error bounds on the fluid limit available in our setting, that the probability P oxit (n, p) of the 
exit from D by chain n _1 z <<n \-) started at n~ x z (™)(0) = y p before time n = \n0\ equals or 1 (up to 0(e~ c ™)), 
depending on whether the trajectory y(-) started at y p remains in D° or exits D, respectively. (See also [DN08, 
Section 4.3] for a basic result on the place of exit.) Thus we have a phase transition for parameter p and as a 
consequence of the diffusive limit result (assuming the chain is non-degenerate) the size of the scaling window 
of this transition is 0(n -1 / 2 ), meaning that P exit (n, vnT 1 / 2 ) = /i(r) + o(l) as n — > oo for some smooth function 
fi. The first intended contribution of this paper is to take the analysis further and determine the finite size 
scaling (FSS) behavior, i.e. explicitly characterize the first correction term f 2 and establish an error bound to 
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the effect of 

P exit (n,rn-^ 2 ) = h{r) + f 2 (r) n -^ 6 + 0{n x l^) (1.3) 

for any e > (see Section 1.1). 

The main result (1.3) has direct applications to the analysis of randomized algorithms in large combinatorial 
problems, where density dependent chains describing position in a state-space S naturally appear. One is 
interested in the probability of such an algorithm failing (or halting) , which corresponds to the exit of the chain 
from some domain, leading to a phase transition (failure vs. success) for the initial data. The expansion (1.3) 
allows one to obtain detailed information about the FSS in quite some generality from essentially the solution of 
three differential equations. Obtaining such detailed information has only been possible so far in special cases, 
for example the Erdos-Renyi random graph at the criticality (cf. [JKLP93], [ABG09] and references therein), 
even though cutoff phenomena like the one just described have been intensely studied. Cf. [Fri99] for existence 
of sharp thresholds for graph properties; [Wor95] for location of thresholds for randomized agorithms; [Wil02] 
for size of a phase transition window ('Harris criterion'); [Di96] and others for cutoffs for mixing times. 

The application described in the previous paragraph is illustrated here with a specific example. Section 5 
develops a framework, based around Proposition 5.2, for the study of the 'peeling' algorithm in large random 
hypergraphs. Within this framework the main result of this paper is applied to establish the FSS behavior in 
the phase transition for the existence of a 2-core in an ensemble of particular interest (see Section 1.2). This 
result has important consequences for information transmission over noisy channels, which are explained in 
Remark 1.4. Morevoer, other ensembles of importance can be now rather straightforwardly analyzed using this 
framework (see Remark 5.5). 

The result (1.3) was derived in [DM08] for a special case (a particular Markov chain), with D = Z + x Z and 
one critical time (N = 1) . In the general setting considered here new methods are needed to handle the difficulties 
coming from handling several critical times, exit through different faces of D, and the chain not being density 
dependent outside D°. The latter is not an artificial condition and occurs in applications, where the algorithm 
failure corresponds to the chain reaching a coffin state at the boundary of D, precluding extension of the kernel 
W n . This is dealt with here by introducing another Markov chain with kernel lF(-|x) = T / F(-|K(a;)) (extension 
of the smooth kernel W through projection K onto D) and comparing the vector field associated with W to the 
smooth extension (from D°) of the vector field associated with W (see Lemma 2.1). In turn the existence of 
several critical times prevents computation of P oxit by conditioning on not-exiting near previous critical times, 
as such conditioning changes the distribution of the process. Instead, multivariate Taylor expansion of standard 
normal distribution in K N is used and it is shown that the dependent terms corresponding to the influence of 
exit near previous critical times are small enough (see Proposition 4.3). This also leads to a certain simplification 
and a sharper error estimate in the asymptotic expansion as compared with [DM08]. 

The remainder of Section 1 is devoted to the rigorous statements of the main result (1.3) (in Section 1.1) 
and the FSS for the 2-core problem (in Section 1.2). 

1.1 Density dependent Markov chain exiting a polytope 

Let D = {x £ M, d : r(i ■ x > gi for i = 1, . . . , IM}, where rTj are unit vectors and gj £ R, be a fixed non-degenerate 
convex polytope in R d and let {W n } be a family of kernels defining transition probabilities on some countable 
set S as in (1.1) and satisfying the following two conditions: 

(a) increments are uniformly bounded, i.e. there exists a finite constant ki such that for all n, z £ S and 
Aetf with ||A|| > ki we have W n (A\z) = 0; 

(b) there exists a probability kernel VF(-|-) : R d x D — > [0,1], such that for each A £ R d the functions 
{x i — ► IF(A|x) : A £ M, d } are twice differcntiablc in D (including the boundary) with (uniformly in A) 
Lipschitz continuous derivatives and such that for some constants k 2 > and e < \ we have 

\\W n (-\z)-W(-\n- x z)\\^<k 2 n-^ (1.4) 

for all z £ S n (nD°), where nD° = {x£R d :n 1 x£ D°} and || • || TV denotes the total variation distance. 
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Let p i— > y p be a twice diffcrentiable map with bounded derivatives from (—6,6) into D°. Let y(-) be a twice 
differentiable (with Lipschitz continuous second derivative) solution of (1.2) on the interval 9 € [0, 9] with y(0) = 
y - Suppose there exists a finite subset 6 = {0],, . . . , 9^} C (0,9) such that d(y(9)) = min^rTj • y(9) — gi) > for 
all 9 e [0, 9]\Q, whereas for each 1 < i < N there exists a unique j(i) such that setting = n^) and di = gj^ 
we have 



m, • y(9 l c ) = d t 



and m, • 0(0*) > 0. 



(1.5) 



To allow for a random perturbation of the initial state y p , let {z n _ p : n, p} be a family of random variables taking 
values in S n (nD°) and such that for some positive constants k 3 , k4, k 5 and k 6 the following hold uniformly in 
n and p: 



P{||z„, p -Ez n , p ||>r} < k 3 e^ 2 /( k ^ 
\\Ez niP - ny p \\ < k 5 



sup sup 

veM N ,d se~R d 



P{U(z„, p - Ez n , p ) < x } - P{n 1/2 UC P < x} 



< k 6 n 



-1/2 



(1.6) 
(1.7) 
(1.8) 



where M. denotes the collection of real N x d matrices and ( p is a Gaussian vector in R d with zero mean 
and positive semi-definite (p.s.d.) covariance matrix Q p , such that p i— > Q p is a Lipschitz continuous map (for 
the operator norm). Note that, in particular, z n ^ p — ny p satisfies this with Q p = 0. 



For each n and \p\ < 6 let P np denote the law of the Markov chain z(-) started at z(0) 



and with 



transition probabilities given by W n . The following is the formal statement of the result announced as (1.3). 

Theorem 1.1. Let r € R, p n = rrT 1 ! 2 and assume that S, defined in (1.9), is non-singular. Then for any 
r] < 1/4 A (1/2 — e) we have 

P no ^ t (n, Pn ) = P„. p „ (mind(z(r)) > o} = $(ra) - b* V$(ra) SI n^ 6 + 0(n^), 

^ T<n J 

where <&(•) denotes the standard normal distribution function in ~R N , a = S^sT and b = S^?A /or i/ie N x N 
p.s.d. symmetric matrix £ and N x 1 vectors T and A wzi/i entries given by (1 <i < j < N) 



= mlQ(^)(B 9 j(^))t^. 



[T]i = 



ml Bo^) ^ 



• §(91) 



p=0 
1-1/3 



mlG(y(^))m, 



2/3 



(1.9) 
(1.10) 

(1.11) 



where G(x) is the covariance matrix ofW(-\x) defined in (2.1), and where B and Q satisfy the system (2.3-2.4) 
at p = 0. Finally, the finite constant ft is given by the integral 



/>oo 

n= / [i -K,(zf]dz, 

Jo 



where 



Ai(iy)Bi(2 1 / 3 z + iy) - Ai(2 1 / 3 z + iy)Bi(iy) 
Ai(iy) 



dy 



(1.12) 



(1.13) 



and Ai(-), Bi(-) are i/ie Airy functions (as defined in [AS64, p. 446])- 



Remark 1.2. Theorem 1.1 can be extended, as follows, to provide some information in the case when £ is 
singular, which can occur when the Markov chain is degenerate. Write £ = Odiag(Ai, . . . , Ajv) O*, where 
O = (vi,...,Vjv) is an orthogonal matrix with {vi} the eigenvectors of £ and {Xi} are the corresponding 



4 



eigenvalues of £ in decreasing order. Let k = mm{i : \i = 0} < N and define V = (vi, . . . , Vfc), the span 
of {vi, . . . , Vfe}. Setting V = (\/AiVi, . . . , \/^T v fc) and Q is a standard Gaussian vector in M, k , straightforward 
calculus shows that the function <&(•) = P(V£ < •) is smooth on M, N \V. It follows (by modifying the last part 
of the proof of Proposition 4-3 to allow for singular £ by projecting {gk + gfc} (there) onto V, and generalizing 
the estimate in Lemma 4-2), that for all r el such that rT ^ V 

P,„ e* it (n, Rn) = $(rT) - A f V$(rT) fin" 1 / 6 + Ofa""), 

w/iere 77 is like in Theorem 1.1. In particular, if (rT + H) n V = 0, w/iere H denotes the negative orthant of 
R N , then actually 

P< lo ^(n,p n ) = 0(n- r >). 

It is also straightforward to modify the proof to yield an expansion for the probability of exiting near a 
particular critical time, or equivalently through a particular face of D. Finally, the result extends to domains 
with smooth boundaries, though the proof then requires analysis in boundary layer coordinates near critical times. 
The choice of polytope domains here is motivated by applications and clarity of presentation. 



1.2 Finite size scaling for the 2-core in irregular hypergraph ensembles 

A 2-core in a hypergraph is the unique maximal sub-hypergraph such that no hypervertex in it has degree 
less than 2. The size of a 2-core is defined to be the number of hyperedges it contains. In the following, we 
occasionally refer to the 2-core as the 'core'. Its existence can be established through recursively deleting all 
vertices of degree less than 2 together with the associated hyperedges, which is the basis for the randomized 
peeling or leaf removal algorithm proposed by [KS81]. Algorithms equivalent to this one, as well as fc-cores of 
graphs and hypergraphs, have been studied as combinatorial objects, in connection to the XOR-SAT problem, 
and in the analysis of low-density parity-check codes over the binary erasure channel (cf. [DM08] and references 
therein for more background). 

We are interested in the ensemble Q = Q(n,m,v n ). Q is a collection of labelled hypergraphs in which each 
hyperedge a is a list (of size j a < L < 00 uniformly in n ) of not necessarily distinct hypervertices chosen 
uniformly at random and with replacement. For each n and j (3 < j < L) there are v„(j) hyperedges of size 
j and we have X^=3 V ™0) = n - Moreover, we assume that there is a non-negative vector v = (vo,3, ■ ■ ■ , v o.l) 
such that |v„(j) — nvo_j \ < cq < 00 uniformly in n. (See Section 5.4 for more details.) 

We are interested in the probability P no corc (n, p) that a random graph G in the ensemble G(n, \ np\ , v„) does 
not have a 2-core. Define V(a:) = ^j^jvjx^" 1 and let 

Pc = m{{p>0: V(C)/p> -log(l-C) VCe(0,l)}Al. (1.14) 

Consequently the set Z = {( e (0,1) : V(()//9 C = — log(l — ()} is non-empty and finite. We then write 
Z = {Ci, • • • , C|z|} with Q < Q for i < j and assume that 

V'iQ/pc < (1 - C)~ 2 for all (eZ. (1.15) 

Let #(•) be the inverse function of the monotonically decreasing bijection ( : [0,1] — ► [0,1], given by ((9) = 
exp{~ Jg r](s)~ 1 ds}. Here rj(s) — J2j= 3 jyj(s) and y(-) is the solution of the system (1.2) with 

F(x) = (-l + R'(l)( Pl -p ),-R'(l)Pi,-S3, -SL) f (1-16) 

yo = (/ie-"/", p c (l- e -^)- M e^ /pc , v f )\ (1-17) 

where p, = V(l), R(£) = J2j=3 5 j^^ 1 an d where 

Po = ^^> Pi= (X - X f' L . ■ *i = ^Pr- ^ 3<j<L, (1.18) 
E J= 3 OX, (e A - 1 - A) X\ = 3 JXj JJ J=3 j Xj 

and A is the unique positive solution of A(e A — l)(e A — 1 — A) -1 = (%2j- 3 jXj — X\ V O)^ 1 . 
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Theorem 1.3. Let p n = p c + rn 1 I 2 and suppose (1.15) holds. Then for any r\ < 1/4 
P„o corcK Pn) = *(ra h ) - b{ V$(ra h ) Qn" 1 / 6 + 0(n""), 



where <& and are defined in Theorem 1.1, while a^ ana 1 are respectively a and b o/ Theorem 1.1 where one 
uses N — \Z\, 9 l c — 9(^N-i), m, = e\ = (1,0,. . . ,0)t, yo defined in (1.17), p.s.d. symmetric matrix Qo wzi/i 
entries (3<i<j<L) 



f 0o,n = Pc7e~ 27 (e 7 -l + 7-7 2 ) 
Qo,i2 = -p c 7e- 27 (e 7 -l-7 2 ) 
Qo,22 = p c e^[(e 7 -l)+7(e 7 -2)- 7 2 (l+ 7 )] 

. <3o,ij = o, 

vector F defined in (1.16), and p.s.d. symmetric matrix G(x) with entries (3 < i < j < L) 



(1.19) 



Gu(x) 
G 12 {x) 

G 2j (x) 
Gij{x) 



R'(i)( Po + Pl ) + [R"(i)-R'(i) 2 ](Pi-Po) 5 

R'(l) Pl -[R"(l)-i?'(l) 2 ]( Pl -p )Pi 
Sj .(j-l-R'(l))( Pl - Po ) 

R'(l) Pl + [R"(l)-i?'(l) 2 ]p 2 
- Sj (j-l-R'(l)) Pl 



(1.20) 



Remark 1.4. Theorem 1.3 should be compared to Theorem 1 of [DM08], where the hyper-edges are restricted 
to have the same common size I < 3. For such hypergraphs it can be shown that the set Z has only one element, 
i.e. the critical solution y has only one critical time, whereas for irregular ensembles like Q(n,m,\i n ) there can 
be several critical times. Thus the general Theorem 1.1 is necessary for the proof of Theorem 1.3. See Remark 
5.5 for more details on the analysis of other initial ensembles. 

The framework of Section 5 and Theorem 1.3 have important consequences for coding theory, which are 
explained here briefly (cf. [RU08] for a comprehensive account). In the analysis of low-density parity-check 
(LDPC) codes over the binary erasure channel (BEC), there appears a natural hypergraph structure, where the 
ratio p = m/n is the transmission rate (equal to the noise level for the channel) and n is the transmitted message 
length. Peeling algorithm on this hypergraph corresponds to a certain decoding strategy and the presence of a 
2-core implies a decoding failure. Design of LDPC codes translates into the construction of hypergraph ensembles 
with maximal p such that the probability of decoding failure vanishes as n, the message length, tends to infinity. 
Such p is bounded above by the theoretical limit for the transmission rate over the channel, the so-called Shannon 
capacity, and the LDPC codes have been shown to achieve rates close to this limit. 

Even though the probability of failure vanishes as n — > oo for any p < p c (some critical threshold), it is still 
significant for message lengths encountered in practice, which could be n = 1,000 to 10,000, and so ultimately 
better estimates of this probability are necessary for successful code design. This motivated the conjecture in 
[AMRU09] about the finite size scaling behavior at this phase transition, which was subsequently rigorously 
proved in [DM08] for the ensembles of regular hypergraphs. 

The overall goal, however, is to design finite LDPC code ensembles that minimize the decoding error (cf. 
[AMU07] for a systematic approach to this task). This optimization gives rise to irregular hypergraph ensem- 
bles (that is with hyperedges of different sizes) with core size (proportion of hyperedges contained in the core) 
concentrating near several distinct values as n tends to infinity. This corresponds to the case of \Z\ = N > 1 
and is a consequence of the so-called matching condition (cf. [RU08, Section 3.14-4]), which translates into an 
integral condition for the generating function V, in terms of which the set Z is defined. It is thus necessary to 
deal both with irregular ensembles and with the case of N > 1, both of which goals are accomplished here. 

This paper is organized as follows. The proof of Theorem 1.1 spans Sections 2-4. Section 2 is devoted to 
preliminary results dealing with the loss of the density dependent property of the kernel at the boundary of D 
and establishing properties of the associated differential equations. Section 3 is an adaptation of the coupling 
arguments from [DM08], culminating with a coupling to a Brownian motion with quadratic drift. Section 4 
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contains the new arguments used to handle multiple critical times. Finally, Section 5 develops the framework 
for analyzing hypergraph 2-cores, where Proposition 5.2 is of particular interest, and concludes with the proof 
of Theorem 1.3. 

2 Asymptotic kernel and the associated vector field 

This section is devoted to preliminary results about the properties of the system (2.2-2.4) needed for our analysis. 
Throughout the paper we denote the open interior of a set A C R d by A° and define the e-thickening of that 
set by A e = {x € H d : dist(a;, A) < e}. We let d(z) = minj(rTi ■ z — e^) and it is easy to check that this 
equals dist(z, R <i \D) or — dist(z, D) depending on whether z lies in D or not. Greek letters r and a are non- 
negative integers denoting discrete time, whereas t and s are non-negative real numbers denoting continuous 
time. Finally, {q} are positive constants, possibly distinct in each proof. 

For x € D let A (a?) be a matrix of partial derivatives of F at x <G D, i.e. [A] ;, = d Xb F a (x), which is indeed 
well-defined for all x e D as F certainly admits a differentiable extension to D s . Also, define G(x) to be the 
covariance of the increment A with distribution VF(-|af), i.e. the p.s.d. symmetric matrix 

G{x) =J2 A (A) f VF(A|f) - F{x) F{x)l (2.1) 

A 

Let K : R d — > D denote the projection onto D, which was assumed to be convex and non-degenerate. We 
extend x^ W(-\x) as well as F, A, and G to the whole of R d defining W(-\x) = W(-\K(x)), F{x) = F(K(x)), 
etc. Note that VF(-|n -1 •) is now a transition kernel on ft d and that for all x E H d we still have F(x) and G(x) 
as the mean and variance, respectively, of the increment A with distribution VF(-|af). However, A(x) is now 
only a matrix of derivatives of F for x e D°. Finally, F, A, and <B are Lipschitz continuous on R d , as they were 
on D. 

For each p with \p\ < 5, we are interested in the following system of ordinary differential equations 

F{y{9)) > 0, y(0) = y p (2.2) 
A(y(0))^dO) e>(>0, B C (C)=I (2.3) 
G(y(9)) + My(O)MO) + Q(0)A(y(0))t 6 > 0, Q(0) = Q p . (2.4) 

The solution to equation (2.2) is the fluid limit mentioned in the introduction and the solution to (2.4) gives the 
covariance matrix of the diffusive limit. The matrix Bf (0) is the matrix of partial derivatives of the function y 
at time 9 with respect to the initial condition specified at time (. 

2.1 Properties of the ODEs 

The following lemma allows us deal with the fact that W(A|-) is not differentiable at x <G <9D. 
Lemma 2.1. The following hold with A = [0,0] x (—5, S). 

(a) There is a unique solution (0,p) t— > y{Q,p), (0,p) H^(9,p) and(9,p) Q(0 ;/ o) of (2.2-2.4) on A. These 
functions are Lipschitz continuous there and moreover there is a uniform (for (0,p) G A) bound on the 
operator norms ||B C (0, p)\\, ||B C (0, p)- 1 \\ and ||Q(0,p)||. 

(b) There are positive constants K\ and n 2 such that for all (9, p) G A 

< ni\p\ 2 + K 2 p P \p\, (2.5) 



ay 
d0 
dB c 

"dT 

dQ 
d0 



(0) 
(0) 
(0) 



y(6,p)-y(6,0)-pB (6,0) 



dp 



p=0 
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where \x p is the Lebesgue measure of the set {s € [0,6*] : y(s,p) <^ D}. Hence, a fortiori, the following 
partial derivative exists 



(2.6) 



p=0 



Proof, (a) Since F(-) is bounded and Lipschitz continuous in D and since for any \p\ < S the inital condition 
y p is in D°, it follows that the solution y(-, •) of (2.2) exists and is unique in A. Applying Gronwall's lemma we 
can deduce that the solution to (2.2) must be Lipschitz continuous with respect to its initial condition, so since 
p y p is differentiable, it also follows that y(-, •) is Lipschitz continuous in both coordinates. 

Next, recall that A(-) and G(-) are Lipschitz continuous, so in view of the established properties of y(-, •) we 
can deduce the same for (9,p) \— > A(y(9,p)) and (9,p) t— > Q(y(9,p)). Moreover, we have assumed p i— » Q p to 
be Lipschitz continuous. It follows that the linear ODEs (2.3) and (2.4) have unique Lipschitz continuous and 
thus bounded solutions on A. In fact, suppressing the dependence on p, we have the explicit respresentation 



Q(0) = B o (0)Q(O)(B o (0)) t + / B c (0)G(y(C))(B c (0))tdC. 



(2.7) 



It remains to be shown that ||B^(^, p) 1 || is uniformly bounded. This follows from the observation that 
detB c (C) = 1, while 

d(detB c (fl)) =(detBc(fl))tr[AWp))] 

and the entries of A(y(9,p)) are uniformly bounded on A. 

(b) In order to establish (2.5), let F e (-) be a twice differentiable (with Lipschitz continuous derivatives) 
extension of F(-) from D to R d . It follows from an elementary result on differential equations (c.f. [Har64, 
Corollary 4.1]) that the differential equation 



dy 
d9 



(9) = F e (y(8)) 0>O, y(0)=V P 



has a unique solution y e with (9, p) h-> y e (9,p) twice continuously differentiable on A and moreover 



(2.8) 



(2.9) 



In particular, y(9, 0) = y e {9, 0), since we assumed that d(y(9, 0)) > implying y(9, 0) G D, and for some constant 
«i > and all (?e [0,9] Taylor expansion gives 



y e (9,p)-y(8,0)-p^(9,0) 



(2.10) 



Now fix p and [s, u] C [0, 9}. If y(9, p) e D for all 9 e [s, u), then F(y(9, p)) = F e (y(6, p)) and Gronwall's lemma 
gives 



\\y{u,p) -y e (u,p) - [y(s,p)-y e (s,p)}\ \ < \\y(s, p) - y e (s, p)\\ e 



{u — «)C2 



(2.11) 



where c-i > is the Lipschitz constant for F e . On the other hand, we can also use the identity F(x) = F e (K(x)) 
to obtain 



\\y(u,p) -y e (u,p) - [y(s,p)-y e (s, 



< 



F e (K[y(s',p)])-F e (y e (s',p)) 

)C2 SU 

s'e[ 

< c 3 (u~s)\p\, 



ds' 



< (u-s)c 2 sup \\K[y(s' , p)]) - y e (s' , p)\\ 

s'£[s,u] 
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where the last inequality follows by Lipschitz continuity of both y and y e , together with an application of the 
triangle inequality 

\\K[y(s',p)]) - Us',P)\\ < ll%»)]) - K[y(s',0)]|| + \\y(s',0)-y e (s',p)\\ , 

and our assumption of y(s',0) € D for s' £ [0,6]. Since none of the constants Cj depend of p or [s,u], by 
continuity of 9 y(#, p) it follows that for all (9, p) e A 

\\y{9,p)-y e {9,p)\\<cj c -p p \pl 

which together with (2.10) gives the inequality (2.5). □ 

The following simple estimate will be used to approximate the mean and covariance matrix of the chain near 
the critical times. 

Lemma 2.2. Let 9 C e (0,9) and fix some positive integer < v < [n9 c \. Then there exist positive constants 
Ki and k 2 such that for any integer r with v < t < 2[n9 c \ — v 

ft. -. . . . S . . I ,. . {> 

./,,_n K 

<7—V 



V F(y(a/n, p)) - F (t - v)/n - F 1 f (s/n - 9 c )d 

Jv-0.5 

T-l 

^G(^(7/n,p))-Go(r-i/) 



< Kl (\v/n-9 c \ 2 + \p\)(T-v) (2.12) 



< n 2 (\v/n-9 c \ + \p\){T-v) (2.13) 



where F = §(9 C ,0) = F(y{9 c ,0)), F 1 = 0(0 C ,O) = A(y(9 c ,0))F(y(9 c ,0)) and G = G(y(9 c ,0)). 
Proof. By the Lipschitz continuity of (9, p) i— > y(9, p), x i— » F(a?) and a; i— » G(x) it is immediate that 

\P(y(a/n,p))-F(y(a/n,0))\\ < c Q [p\ 

\\G(y(a/n,p))-G(y(9,0))\\ < n 2 {\a / n - 9\ + \p\) 

(where the second inequality is for the operator norm) from which (2.13) follows. To obtain (2.12) recall that 
9 i ► y(9,0) was assumed to be twice differentiable with Lipschitz continuous second derivative. Consequently 
the following Taylor expansion holds 



f(CO)-f(fc,O)-(C-(9c)0(<9c,O) 



<c 1 \(-9 c 



and taking m — c V c\ we have 



r— l r— l 

]T F(y(a/n, p)) - £ [f - F^a/n - 9 C ) 



<Ki(W/n-9 c \ 2 + \p\)(T-v). 



Observing that f(a) = Sa-o t f( s )^ s f° r /( s ) = 1 anc ^ /( s ) = s / n ~~ yields the desired conclusion. 



(2.14) 



(2.15) 



□ 



In the next lemma we establish certain discrete approximations to the solutions of the differential equations 
(2.2)-(2.4). These approximations will be related to the mean and covariance matrix of the auxiliary process 
to be introduced in Section 3. Fix some J n C [0, n9]. Let /„ = [0,n9]\J n and define the matrices A r = 
(r/n, p))t T& i n . Further, let the matrix B£ = I for a > r while for < a < t let 



B T a = [I+-A T 
n 



1+ -A c 
n 



Finally, define the following sequences (as before a and r arc non-negative integers) 
y*{r + l) = y*(r)+A T (y*(r)-y(T/n,p)) + F(y( T /n,p)), y*(0) = ny p 



(2.16) 



Q*(r + 1) = nB5Q p (B5)t+ ]T W a+1 G(y(a/n, p))(W a+1 ^ , Q*(0) - nW Q„(B5)t, (2.17) 

o-e/„n[o,r] 

where we suppressed the explicit dependce of y* and <Q* on p. The following lemma is proved in Appendix A. 
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Lemma 2.3. There are positive constants K\, k 2 and K3 such that for all t € [0,n#], C € [0,r/n] and n large 
enough we have 

||n _ V*(T) -y(r/n,p)\\ < k^ 1 (2.18) 

< K2(l + |J n |)n _1 (2.19) 
||n _1 Q*(T) -Q(r/n,0)|| < k 3 (1 + | J„|)n _1 . (2.20) 

Moreover, matrices {BJ : < a, t < n9} and their inverses are uniformly bounded in operator norm. 
2.2 Fluid limit with error bounds 

The following is concentration inequality for a Markov chain with a fluid limit. Its proof isd a straightforward 
generalization of the proof of [DM08, Lemma 5.2]. For similar results for more general discrete-time processes 
see [Wor95] and for pure-jump Markov process see [DN08]. 

Lemma 2.4. Let z(-) be a Markov chain started at some initial distribution z(0) with kernel W(-|n -1 -). Then 
there exist constants n\, k 2 and k 3 , depending only on the constants K and 9, such that for all n and any 
< t < [nS\ : 

(a) z(-) is exponentially concentrated round its mean 

P„. p {||z(t)-E„ iP z(t)|| > r }< Ko expf-— ) , (2.21) 

(b) and the mean is close to the solution y(-) of the ODE (2.2) 

I \^u, p z(t) - ny{r/n) \\ < k 2 \ |E„, p z(0) - ny(0) \\ + k 3 ^nlogn (2.22) 



3 Coupling arguments 

In this section we construct three couplings that allow us to approximate the distribution of the min T < s d(z(r)) 
where z(-) is the Markov chain of Section 1.1. The methods of proof owe a substantial debt to those of [DM08]. 
We define an extension W n of the transition kernel W n by 

y 1 ' \ WiAzln^z) otherwise v ; 

It follows, that when started at the same initial distribution, the Markov chain with kernel W n and the one with 
kernel W n coincide until the first time they reach the boundary of nD. Furthermore, notice that our assumption 
(1.4) gives us the following estimate for the total variation distance 

W n (-\z) - W^n^z 1 ) < k 2 n- 1+E + k 7 n~ 1 \\z- z"\\ (3.2) 

TV 

where U.7 is the Lipschitz constant for W. The following lemma allows us to reduce the original problem to the 
exit problem for the Markov chain with kernel T / F(-|n -1 -). 

Lemma 3.1. Let z(-) and z'(-) be Markov chains in R d with transition kernels W n (-\-) and W(-|n -1 -) respec- 
tively and with common initial distribution z(0) = z'(0). Then there exists a coupling (,?(•), £'(•)) and positive 
constants K\ and k 2 , depending only on 9 and the constants k i; such that for all r > and any time 9 > 



P„ i sup I 

t r < n 



\z{t) - z'{t)\\ > rn e \ < exp{n £ (Ki - K 2 r)}, (3.3) 

l^r < n J 

where n = [n9\ . 
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Proof. We couple z(-) and z'(-) by first setting z(0) = z'(0) and then iteratively coupling for each r > 1 the 
increments Az — z(t) — z(t — 1) and Az' — z'(t) — z'(t — 1), so that 

P„{AzV Az'\z,z'}= W n (-\z)- Wi-ln^z') . (3.4) 

TV 

Define D T {\) = E„[e AZ ( r '], where Z(t) = sup CT<T \\z(a) — z'(a)\\. It is enough that we prove the bound 
Dn(n2) < exp{«;in e } for some positive n\ and K2- This proof follows like the proof of [DM08, Lemma 5.1], 
where we take c = c n 5 and allow the constants to depend on 0. □ 

As in the comments preceding Lemma 2.3, let J n C [0, nd], define /„ = [0, n8]\J n , and recall the definitions 
of A T and W a . We now suppose that | J n \ < n 13 for some £ (0, 1), where | • | denotes the Lebesgue measure on 
R, and define the auxiliary process 



z'(t + 1) = {A T - k T y{T/n, p)) + B t t z'(t), z'(0) 



where {A T }^L n are independent random variables with distributions W(-\y(r/n, p)). Let denote the dis- 
tribution of z'(-). 

Furthermore, suppose that there exist some positive constants cq and 7 and a slowly varying function L(n), 
such that for z(-), being the chain with kernel T / L(-|n~ 1 -) and z(0) = z„,p, we have 



Pn,p \ SUP 
{T<n 



z(t) -ny(r/n,p)\ \ > c n'L{n) \ < -. (3.5) 



Assume also, that for some ci < Co we have d(y(8,p)) > cin 1 L(n) uniformly in {9 : n6 e /„} and \p\ < S. Then 
the following holds. 

Proposition 3.2. For any S > max{7/2,27 — 1,0 + 7 — 1} there exist finite positive constants Hi and k 2 
(depending only on the constants ki, cq and c\ above) and a coupling of the process z(-), with kernel W(-|n~ 1 -) 
and initial distribution z n , p , and z'(-) with law P^,, such that 

P n , p ( sup \\z(t) - z\t)\\ > K.n 6 ) < ^ (3.6) 

for all \p\ < 8 and n. 

Proof. Observe that we can couple z and z' by taking z'(0) = z(0) and coupling the increments so that 

P ni „(A*(r) ^ A r |jT) = WWi-ln-'zir^-Wmr/n))^, (3.7) 

where as before Az(t) = z(t + 1) — z(t) and T T is the cr-algebra generated by {z(a), z'(a) : a < r}. 

We will now show that it is enough to consider the coupling only until the first time the process z(-) and 
the solution y(-) (suppressing dependence on p) are more than c^n 1 L(n) apart. Indeed let < n denote the 
first value of r such that ||<?(t) — ny(r/n)\\ > c n' ) L(n). Then by assumption (3.5) we have 

P„, P {r* <n}< n- 1 , (3.8) 

and it follows that to prove the statement of the proposition, it is enough to show that 

supP„ iP {T >t < n , \\z{t) - z'{t)\\ > K in 6 } < n~ 2 . (3.9) 

r<n 

This is achieved by separately bounding the ^-" s -martingale {Z s } and the predictable process {V s } in the Doob's 
decomposition of the adapted process N s = (Bo _1 ) _1 (z'(s) — z(s)) = Z s + V s , where Z = Vq = and 

AV S+1 = V s+1 -V s = K n {N s+1 -N s \T s } = (B s - 1 r 1 R(n- 1 z( S ),y( S /n),s) 
AZ S+1 = ^ +1 -Z S = (B^ 1 )- 1 {A S -Az( s )-E„[A S -Az( s )|^]} 
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for 

R(x',x,s) = F(x) + I s€ln A(x)[x' - x] -F(x'). 

Since A(-) is Lipschitz continuous and uniformly bounded on R d and since it is the matrix of derivatives of F 
at every x G D°, we have 

\\R(x',x, s)\\ < c 2 \\x' — x\\ 2 for s G /„ and x, x' G D°, 
| \R(x", x, 9)\\ < cs\\x' — x\\ otherwise. 

Now observe that by the assumption on d(y(-)) we have y(s/n) and n _1 2(s) in D for s G and n large enough 
as long as r* < n9. Thus by the bounds on R 

||AT4|| < I^^calln- 1 ^^) - y(s/n)\\ 2 + I^c^n" 1 ^) - y(s/n)\\ 

leading to \\V S \\ < c z \I n \{n<- 1 L{n)) 2 + c A \J n \(rF- 1 L(n)). 

To bound \\Z S \\ we use [DM08, Lemma 5.6] with U s = AZ S as follows. Throughout we take advantage 
of the uniform bound on ||(1Bq _1 ) _1 ||. Since both A s and Az(s) have uniformly bounded support, we have 
\\U S \\ < 4c 5 . Moreover on s < t, 

||E n [A:|^]|| < c 5 P„(A: ? 0\T S ) < cslln" 1 ^) - y(s/n)\\ < ^"^(n) 

implying 

F n {\\U s \\ > c 5 L(n)/n\F s ) < ¥ n (A* s + 0\f a ) < c ^- Y L(n) . 

Combining the two above estimates gives ||{/ s || < c^n 1 ^ 1 L(n) + Ac^Ia where P„(A) < c 5 n 7 ~ 1 L(n), making it 
clear that inequality [DM08, (5.19)] holds with T = c 5 n' y ^ 1 L(n). Consequently, applying [DM08, Lemma 5.6] 
with a (there) taken to be n v gives 

Pn(||^a|| > < 2dcxp{-n 2 ^L(n)- 1 /(2d9)} 

as long as n n < nTd — c?n 7 L(n), i.e. i] < 7. To get a nontrivial bound we also need 2n > 7, so that 
Pn(||Z s || > n v ) < rT 2 for 77 G (7/2,7) and n large enough. This gives the required conclusion for any 
<5 > max{7/2,2 7 - l,/3 + 7 - 1}. □ 

The process with distribution P^"* has independent increments for r G J n , which allows us to couple it with 
Brownian motion. Let us fix J n — L>fL 1 J^ l , union of finitely many intervals ,P n = [t^tJJ with = \n9 l c — n^J 
and T l n — \n9 l c + n^\. We define t° = t° = for notational convenience. Notice that | J n \ < 2NvP . 

Define X z (t) = \Fit 2 + ^fGlW l (t), where {W l (-)}f =1 are independent doubly infinite standard Brownian 
motions conditioned on W l {0) = and where F t = 0(0*) = A(y(0», 0))F(j/(6»*, 0)) and G t = <G(y(6£,0)). 
Finally, let 

Xi(r) = n^[X\n- 2 '\r 0.5 - <)) - X\n- 2 l\f n - 0.5 - «#))]. 

Lemma 3.3. Let {3 £ (2/3,1) and\p\ <nP ~ x for /3' <2fi—\. Then for any S > 3(3 — 2 there exist finite positive 
constants K\ and k 2 and a coupling of the process z'(-) with law P*"* and t/ie Gaussian processes {X^(-)} N , 
smc/i i/iai /or eac/i i = 1 . . .N we have 

P„,J sup I m, - {F'(L*J) - ^'(^)} - >«i«4 < - 

Proof. In the following, we assume that det G(y(0*)) 7^ for all i, as for each i with det G(y(9 l c )) — the result 
is just a statement about the approximation of a Riemanian sum by an integral (this is clear from the last 
paragraph of the proof). 
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First, notice that the chain z' {■) has independent increments for r € for each i = 1, . . . , N and moreover 
for i ^ j the increments in ,P n are independent of those in J^. Define random variables 7] = mj{z '(rj, + j) — 
£'(r^ + j — 1)} for 1 < j <T % n — T^ n , which arc uniformly bounded by l<! and have mean and variance 

E n7 j = mi • F(y(r/n)) and Var 7 ] = m|G(y(r/n))m, . (3.10) 

We can then apply Sakhanenko's refinement [Sak85] of the Hungarian construction to conclude that there exist 
independent real-valued Gaussian processes {b l n {-)}f =1 , with b % n {-) defined on J % n and satisfying b l n {T^ n ) — 0, such 
that 

P n J sup I m, • {z'(t) - z'iri)} - 6 l „(r)| > c logn 1 < ^. (3.11) 



T"6 J* 



7i 



The increments 6J,(r + 1) — 6J,(r), r € have mean and variance equal to the mean and variance of 7* 
The inequality (3.11) follows by applying Chebyshev's inequality to the result of [Sha95, Theorem A]). 
Thus for i = 1 ... N we have the respresentation 



T-l 



K(r)= £ m i .F(y{a/n))+B i £ mlG( y >/n)) 



m, 



where B % {-) are independent standard Brownian motions. Since the real-valued Gaussian processes {X^(t),t > 
r^} admit the representation 



X i n {t) = F i f ' (a/n-e^da + B^Giit-ri/n)), 

Jri-0.5 



to conclude the proof, it is enough to show that this coupling of b l n {-) and X^(-) is such that all n and i 

P„{sup K([t\) - Xl(t)\ > 3c 2 n 5 } < ^, (3.12) 

teJ*, n 

where the t in the supremum takes values in R. Since {X^(r + s) — X^(r) : s <G [0, 1]} have the same law as 
{B l {Gis) + a n T (s) : s € [0, 1]} for some non-random a„ T (s) bounded uniformly in s e [0, 1], n and r G we 
get (3.12) as soon as we show that 

sup P„{K(r) - X* (t)| > 2c 2 n 5 } < n- 2 . (3.13) 

This follows by standard Gaussian tail estimates from the fact that |E n 6^(T) — E„A^(t)| < C4(n 2 ^ _1 ) + 
n P -!) n /3 < C4n 3/3-2 an( j Var(6Jj(r) — X l n (r)) < c 5 (n /3_1 + n /3 ~ 1 )n 13 = cqu 2 ^^ 1 , itself a consequence of Lemma 
2.2 and the bounds on |J£| and □ 

All the above couplings are collected in the following proposition. 

Proposition 3.4. Let [3 e (3/4, 1) and \p\ < n 13 _1 for (3' < 2(3 — 1. TTien fftere exzsis a positive constant n, 
such that for 5 > (3/3 - 2) V e 



Pn {K„ > n 5 } - - < P(n, p) < P„ {K„ > -n 5 } + - 



where [K„], = m|z'(r^) +inf te ji A^(i) — dj and z'(-) is the process with law P™. 

Proof. Taking r = c 3 logn with some C3 > ki/k 2 in Lemma 3.1 we see that there exists a coupling of the 
Markov chain z(-) with kernel W„(-|-) and the chain z"(-) with kernel W(-|n -1 -) (both started with distribution 
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z(0) = z"(0) — z n ,p) such that with probability exceeding 1 — 1/ n the two processes are at most e„ = c 3 n e logn 
apart until time = min{r > : z(t) < 0} A \ n6\ and therefore 



P„ J min d(z "(r)) > e n \ - 1 < P n { min d(z(r)) > 



>„ | min d(z(r)) > o| 



< P n \ min d(z"(r) > -e„ I + - (3.14) 

Next, observe that by the bound of Lemma 2.4 and the assumption (1.7) the events {min TI£l7i d(z"(r)) < 
±e„} have probability smaller than c^/n (taking C3 big enough) and hence we can restrict our attention to 
min-rgj^ &(z"(t)). Transfering the couplings in Proposition 3.2 (whose hypothesis is satisfied for the J n we are 
considering here with 7 = 1/2 and L{n) — logn) and Lemma 3.3 onto one probability space, it follows that 



P n jmj^rjj + mf X\{t) - d, > n 5 , i < JvJ 



C5 
n 

< P„ { min d(z"(r)) > ±e„ 

[t£J„ 

< P„ f mt^(rj,) + inf X* (t) - d, > -n 5 , i < Tvl + — 

where S > (3/3 — 2) V e. Combining this with the estimate (3.14) yields the conclusion of the proposition. □ 



4 Weak convergence arguments 



Having reduced our problem to that of estimating the probability that the random vector K„ lies in a convex set, 
we can now leverage a convenient decomposition of K„ into a sum of independent vectors, that will enable us to 
apply the central limit theorem and subsequently use the Taylor expansion for a smooth probability distribution 
function in R N . Our methods here depart from those of [DM08] and seem to simplify the argument leading to 
a somewhat tighter bound on the error. 

To begin with, let us define the R N vectors with entries (1 < i < N): 



\T n ]i = ml{y*(rj l ) + E„, p z(0)-ny(0,p)} + ^n 1 / 3 4 -d, 
as well as symmetric matrices £„ whose entries are given by (1 < i < j < N) 

[S n ] w = mjQ^B^j)^ 
where B(i,j) = B=?. In what follows, also B(i,j) = B e i(6 J c ). 



(4.1) 
(4.2) 



Lemma 4.1. Let r G R and take p = p n = rn 1 I 2 in the definition of S„ and Y n . Then there is a positive 
constant k such that for all (5 £ (3/4, 1) and for all n 

Hrir 1 !!,, - SI 



< nn 



1/2 r„-rr|| < 



nri 



0-1 

3P-5/2 



(4.3) 
(4.4) 



Proof. We suppress the dependence of Q on p and start by expanding the difference 

n-^&Mijy - Q(flj)B(i, j)t = n-^iri) [B(i, j) - B(i,j)] f + [n- 1 ®*^) - Q(fl')] B(i, j)t 

to see that (4.3) follows from the estimates proved in Lemma 2.3 and the uniform bound on the norms of all 
matrices involved. In turn to see (4.4), note that the proof of Lemma 2.2 gives the bound 



[n9H 



™\v*{\nei\) - mjy*(ri) + F, ]T (a/n - 61) 



< con 3 ?- 2 , 
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a simple calculation confirms that 



whilc it is shown in Lemma 2.3 that 



-V*(L0>J)-y(Cp)|| <c in -\ 



and from Lemma 2.1 we have 



y(e i c ,p)-y(e i c ,o)- P f p (e i c ,o) 



< c 2 n fl ~ 1 \p\. 



Finally |E„ iP z(0) — ny(0,p)\ < k.-> by (1-7). Putting these estimates together, and using p„ = rn l l 2 and 
di = y(0i, 0), completes the proof, since for /3 e (3/4, 1) we have -1/2 < [3 - 1< 3/3 - 5/2. □ 

The next lemma is a bound on the distance between Gaussian distributions in terms of the differences 
between their means and covariances. 

Lemma 4.2. Let x and y be Gaussian vectors in R N with means a, b and positive definite covariance matrice 
A, B respectively. Let S > be a lower bound for the eigenvalues of both A and B. Then there exists a constant 
K depending only on N and S such that 

sup |P(x gC)- P(y e CO | < k {||a- b|| + ||A - B|| Jlog+(||A - B||)j , 
cec L v > 

where C denotes the collection of B or el-measurable convex sets in R N and log + x = \ logx\ for x > and equals 
otherwise. 

Proof. We couple x and y by taking x = Sz and y = Tz, where z a standard normal Gaussian vector in R N 
and S and T are symmetric positive definite square-roots of A and B respectively, i.e. SS^ = A and TT^ = B. 
Let a = ||S — T|| and let r = a^/3 log + a + ||a— b||. Observe that for a > we have 

P(||x-y|| > r) < P(||S -T||||z|| + ||a- b|| > r) < P(||z|| > ^3 log + a) < ma (4.5) 

for some m > 0, where we have used the standard Gaussian tail estimate P(||z|| > u) < PN-2(u)e~ u2 ^ 2 for 
some polynomial pn-2 of degree (N — 2) V with coefficients depending only on N. In turn for a = we have 
r = ||a— b||, so the probability in (4.5) is trivially 0. Next, conditioning on ||x — y|| < r and using monotonicity 
of measure gives 

P(x e C) < P(y G C r ) + /sia < P(y e C) + n 2 r + mat (4.6) 

where C r = {x e R N : dist(x, C) < r} and the last inequality for some k 2 > depending only on TV follows 
by a uniform bound on integrals over convex shells of width r (see for example [BR76, Corollary 3.2]). Since 
by symmetry the above inequality also holds with x and y swapped, to complete the proof we only need to 
show that ||S — T|| < Ks\\A — B|| for some positive K3 that may depend on N and S. As all norms in R N are 
equivalent, we will demonstrate this in Frobenius norm, defined by ||A|| FR = Tr(A 2 ). 

To this end, we write (S - T) 2 = OLO^ and (S + T) 2 = UDU\ where O, U are orthogonal and L, D are 
diagonal and positive. This is possible because S and T are positive definite and symmetric. Let the elements on 
the diagonal of L be k and on the diagonal of D be di, where di > 25, which follows from (S+T) 2 = A+B+2ST, 
where all the matrices are positive definite and S is a positive lower bound on the eigenvalues of A and B. Thus 

Tr[(A — B) 2 ] = Tr[(S-T) 2 (S + T) 2 ] =Tr[OLO*UDU*] 

= Tr[UOLO^UD] = ^ d t h > 2<5Tr[(S - T) 2 ] 
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demonstrating that ||S — T|| FR < (25) 1 ||A — B|| FR and concluding the proof. 



□ 



In preparation for the next proposition, we introduce a decomposition of K„ into a linear combination of 
random vectors Y„, Z„, H„, and X„ with entries (1 < i < N) 



[Y„] 4 ee Yl mlB^ +1 {A ff -F(y>/n))} 



0<a<Tl,aiJ n 



[Z„]i = mTB(0,i)(F(0)-E ni ^(0)) 



2 mjB^ +1 {A ff -F(y>/n))} 



<r(zJ n ,cr<r! i 



l 



Pt„]i = inf + 

where s l n = n^ 2 ^ 3 [n /3 + 0.5] and u n = n~ 2 / 3 [nP - 0.5], so that 

m, • z'(f n ) = m t • y* (d) + \Yn}i + [Z„], + n^ 2 [H„] 4 . 

and consequently 

K„ = r„ + Y„ + Z„ + n^H,, + n 1 / 3 X„. 



(4.7) 

(4.8) 
(4.9) 

(4.10) 



(4.11) 



The merit of writing K„ like this lies in the fact that Y„,Z„, and (H„,X„) are mutually independent and 
centered. 

Proposition 4.3. Suppose that £ is positive definite and fix /3 G (3/4, 1) and <5 > (3/3 — 2) V e. T/ien t/iere 
exists a positive k, such that for any (3 <G (3/4, /3o), ?7 < min{l/2 — 5, 1 — /3, 3/3 — 5/2}, r e E, p„ = rn^ 1 / 2 and 
n sufficiently large we have 



P, 



{K„ < ±n 5 } - P n {Y + n-^ 1 -' 3 )/ 2 ^ + n-^Xn < o} 



< /en - ", 



where Y is an independent Gaussian vector in EL urei/i mean rT and covariance matrix X. 

Proof. We rescale K„ by a factor of n -1 / 2 and observe that for any random vector y, independent from the 
random vector pair (xi,x 2 ), we have 

|P(zi<y)-P(z 2 <y)| < E|P(a;i <y\y)-V{2 2 <y\y)\ 

< sup |P(a;i <z)- P(x 2 < z)| . 

Thus conditioning on (H„,X„) (which is independent from Y„ and Z„) we will be done once we show that 



sup 



P„ {n-'/ 2 (r„ + Y„ + Z„) ± n 5 - 1 ' 2 < x} - P„ {Y < x} 
To this end first recall that by our assumption (1.8) we have 



(4.12) 



sup sup 

VeM N ,d x6E. d 



P„, P {n- 1 / 2 U {z(0) - E„, p z(0)} < x} - P {n 1 / 2 UC P < x} | < l^n" 1 



/2 



where ( p is a centered Gaussian vector in IL d with positive semi-definite covariance matrix Q p . Setting U = 
HJ„ = [miB(0, 1), . . . , mjvB(0,iV)]^, writing A„ = n 1 / 2 !!^ and conditioning on Y„ (indepedent from Z„) in 
(4.12) immediately reduces our task to that of showing 



sup 

xGffi™ 



P„, p {n" 1 / 2 (Y„ + A„) + n- l l 2 Y n ± n^ 1 / 2 < x} - P {Y < x} 



1G 



This follow will from Lemmas 4.1 and 4.2 once we prove that for N a standard normal vector in R we have 



sup 

xeR« 



V n , P [n- 1,2 {Y n + A„) < x} - P„ {V„N < x} 



< n 



(4.13) 



V 

v n v n - 



where Var(Y„ + A„) = S r , 

Since £ is assumed to be positive definite, by (4.3) for n large enough £„ is positive definite, too. Thus 
(4.13) is a consequence of 



sup 

CeC 



{n- 1 / 2 V" 1 (Y„ + A n ) e c| - P {N e C} 



< ri 



-1/2 



(4.14) 



where as before C denotes the collection of Borcl-measurable convex sets in R w and N is a standard normal 
vector in M, N . Indeed it is enough to consider the convex sets C = C(x) = {x' e R N : V„x' < x}. 

Thus we conclude by proving (4.14) with an application of [BR76, Theorem 13.3]. Namely observe that Y„ 
is a sum of n independent centered random vectors gfe in R N where 



[gk]i = U<nJkei n ^M n { A k - F(y(k/n))} 



and these vectors have uniformly (in n and p) bounded fourth moments (|A T | < ki). Similarly the sclf- 
decomposability of the normal distribution gives us i.i.d. centered Gaussian vectors such that A„ = J2k=i Sk- 
Hence applying [BR76, Theorem 13.3] to the independent vector array V^ 1 (g^, +§&), whose sum (over k for 
each n large enough) has a covariance matrix uIn, yields the required result. □ 



4.1 Proof of Theorem 1.1 

Combining the results of Propositions 3.4 and 4.3 we get 

Pn 0ex «(n, Pn ) = P„{r + Y + n-( 1 -«/ 2 H„ + n- 1 / 6 X„ >0} + O(n-") 

{$ [s-i/a ( r + n^l 2 [B. n + Xl 1 )] + n- 1 ^))] } + 0(n""). 



where [X^ 1 ^] 



,/3/2-l/3 



(X*(-* n ) -Fisl/2) and = inf te 



I X 1 (t) . Our normalization and centering 



ensures that (uniformly in n) 

E n [H„] i = 0, Var[H„] 4 <C, E„[XW], = , and Va r [X ( n %<C. 
We will now show that there also exist positive constants ci , C2 such that for all 1 < i < N we have 



E 



inf X l {t) + Fr 1/3 G 2 / 3 n 

*€[— s n ,u n ] 



< Cl F7^G 2/ \- 1 



E 



inf X\t) 
te[-s n ,u n ] 



< 



c2 ^-2/3 G 4/3 E|y| 2 K ^ 



where V = ini te ^X(t) for X(t) = \t + W(t) with W(-) a double-sided standard Brownian motion. 

In [Gro89, Theorem 3.1] it is shown that the distribution function of V is Fy(v) = 1 — JC(— v) 2 t v<n , where 
/C is defined in the (1.13) and this explicit formula follows from [Gro89, (5.2)] taken with c = 1/2 and s = 0. 
Integration by parts reveals that EV = —ft, for the ft defined in (1.12). 

As a consequence of the above and the Brownian scaling giving 

X l {t) = F~ 1/3 G 2/3 X(F 2/3 G; 1/3 t) 



(the case Gi = is trivial), it is enough to show that 

E|K-V| < cm- 1 

E|K| 2 < c 2 E|F| 2 <oo, 



(4.15) 
(4.16) 
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with V n = inf te [_5 re x/„] X(t), where X(-) is the same doubly infinite Brownian motion as in the definition of V 
and S n and U n are positive sequences increasing to infinity, such that T n = S n AU n > F^G^^n' 3-2 / 3 . To 
this end notice that (4.16) follows immediately from > V n > V. To see (4.15), define the random time 

T = mf{s > : X(s) = inf X(t) or X(-s) = inf X(t)} 

t £ R. t G R. 

and apply the Cauchy-Schwarz inequality to obtain 

V\Vn~V\ <E{|V n -F|l| T | >Tn } < ||K-F|| 2 P{|T| >T„} 1/2 < ||V|| 2 P{|T| >T„} 1/2 . 

In [Gro89, Corollary 3.4] it is proved that P {\T\ > t} < 1 e~ Aat \ which together with T n > if ^G^V" 2 / 3 
will imply (4.15) once we show that E|]/| 2 < oo. This is achieved by using the same estimate on P{|T| > t} 
and bounding as follows 

T(V<v)<f{ inf X(s)<v\ + Ao 1 e- Aot3 <\ inf W(s) < vX + V^ ' 3 < e^ 2/2t + A^e^ * 3 , 

so that taking t = ^Jv we can deduce that ¥(V < v) < C -1 cxp(— C|v| 3 / 2 ) for some C > and all v < 0. In 
fact, it follows that all moments of V are finite. 

To conclude, we use second order Taylor expansion for <J> and the bound < C3 for all n to obtain 

P(n, Pn ) = $ (rS-Var) - (e^a)* V0> (rE" 1 / 2 ^ ftr^ 1 / 6 + ^"^{(/J-iJ.-i/a.-i/e-i,-,}, 

where [A]; = Fr 1/3 G 2 / 3 . Finally, observe that since for /3 e (3/4, 1) we have 77 < 5/2 - 3/3 < 1 - (3 < 1/4 with 
5/2-3/4=1/4, the above holds for all rj < 1/4. □ 



5 Finite-size scaling for the 2-core in irregular hypergraphs 

This section is devoted to the analysis of the phase transition for the existence of the 2-cores in hypergraph 
ensembles and specifically the proof of Theorem 1.3. Section 5.1 defines the state-space paramatcrizing ensembles 
of interest. The peeling algorithm and the density dependent Markov chain deriving from it is treated in Section 
5.2. The asymptotic probability kernel VF(-|-) is furnished in Section 5.3. In Section 5.4 it is shown that the 
paramtetrization of the initial ensemble satifies the hypothesis (1.6)-(1.8). Finally, the proof of Theorem 1.3 is 
in Section 5.5. 

5.1 State-space of ensembles 

We model hypergraphs as bipartite graphs with the set of vertices partitioned into v-nodes and c-nodes (the 
configuration^ model). The hyperedges correspond to v-nodes and hypervertices to c-nodes, and the resulting 
hypergraphs is more precisely a hyper-multigraph. 

Fix integers K > 2 and L > 3. Every ensemble in the state-space is characterized by the non-negative 
integers 

z= (-21, ... ,z k +l-2) = (w,r) = (wi,w 2 , •■• ,wjr,T 3 , ... ,tl) , n, and m, 

and denoted Q(z) = Q(uj,t). We define d(r) = Y^j=^i T j an d t(t) = n — J2f=3 T ji where we drop the explicit 
argument, when it is clear from the context. 

In order for Q(z) to be non-empty, we require that J2iLi z i — m > J2j=3 T j — n an d either zk > 1 and 
Y^f=i — or zk = and J2iLi = An element in the ensemble is a bipartite graph 

G = (U, V3, V4, . . . , Vl\Rq, . . . , Rk] E), 
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where (denoting disjoint union by II) we have CIIV^II- • -UVl = [n] (called the set of v-nodes) and i?otl- ■ URk = 
[m] (called the set of c-nodes), and the cardinalities of these sets are \U\ — t, \Vj \ — tj, \Rq\ — m — Y^f=\ z i an d 
\Ri\ — Zi. Finally E is an ordered list of n — f edges 

E = [(ai,Oi), (ai,o 2 ), • • • , (a^ajj; (a 2l a h+1 ), (a 2l a h+2 ), (a 2 , a h+h ); 

(a n _f , aj 1+ ...+i n _._ 1+ i), . . . , (a„_f , a; 1+ ...+; n _.)] 

such that the pair (a, a) appears before (/3, 6) whenever a < j3. Moreover, each a € Vj appears in the list exactly 
j times, while none of the a G U appear in any of the pairs in E. Analogously, none of the a G Rq appears in 
E; each a G Ri for 1 < i < K — 1 appears in exactly i edges; however, each a G Rk appears in at least K edges. 
The total number of graphs in Q{z) is thus 

h{Z) = \Q{2)\=( m )( n )\d\f{^r Zi \co^[e K ^r\A (5.1) 
\Z!,..., z K , ■) \t 3 , -..,t l ,-J fj^ J 

where d = d(z) = d(f) - Y,f=i iz i and e fc( x ) = Y^iLk x% 



5.2 Exact kernel 



We consider the following graph-valued process G(-). We assume that the distribution of the initial graph G(0) 
is such that conditioned on {G(0) G Q(z)} (when this event has positive probability), it is a uniformly random 
element of G(z). At each time a = 0, 1, . . . if the set of c-nodes of degree 1 is non-empty, one of them is chosen 
uniformly at random. Let this c-node be a and observe that there is a unique v-node a, such that (a, a) G E. 
This edge and all other edges (a, ■) G E (edges incident to the v-nodc a) arc deleted and the graph thus obtained 
is denote by G(a + 1). Otherwise, that is if there are no c-nodes of degree 1 in G(a), we set G(a + 1) = G(a). 

We can now define the process {z(a) = (uj(a) , f(a)) , a > 0} on 2^ +i-2 , where Zi(a) = LUi(cr) is the number 
of c-nodes of degree i in G(a) (1 < i < K) and z K+ j_ 2 (a) = Tj(a) is the number of v-nodes of degree j in G(a) 
(3 < j < L). Notice that we have 



K 



^iZi(a) < d(f(er)) = ^jr^a), 

i=l 

and that f = t(t(ct)) = a A min{cr' > : zi(er') = 0}. 



j=3 



Lemma 5.1. The process z(-) is a Markov chain with kernel W n defined by: if z\ — 0, then W n (Az\z) = 1az=o/ 
if Z\ > 0, then W n (Az\z) = unless there is exists a unique £ G {3, . . . ,L} such that At^ = — (then in 
particular Ad = d(Ar) = — I) and we have 



W n (Az\z) = ^Niz'lz), 



where z' = z + Az and 



N(z'\z) = (t + 1)£\J2 



m 



9oi, 



T K z' 
• • • , qoK 



with r = n — r = n — X^j=3 T j an ^ 



7r(Q) = coeff 



ei(x) 



z i 

9i2, • • • ,qiK 



K-l 

II / 

3=0 



Z ' K - X )( Z ' K )«/.,! X(Q) 



(5.2) 



(5.3) 



UK 



(5.4) 



where I = So<i<j<if ~ an< ^ ^ e collection T> consist of all triangular arrays of non-negative integers 
Q = {qij > 0: < i < j < K}, such that qu = for 1 < i < K — 1 and the remaining entries satisfy the system 
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(writing z = m — Eili z i an< ^ analogously for z' a ) 



*0 



- l^j=l 10j 
^K 



9ij 



+ 9oi 



e; 



=0 Qji 



ZK-1 

z k 



-K-l 



1{K-1)K 



= Z 



K 



+ Ej=0 Ij(K-l) 
+ Ei=0 IjK 



and the inequalities 



d(r)-Ef=iizi > i-E 



0<i<j<K 



> £L«y for\<i<K 



(5.5) 



(5.6) 



Moreover, conditional on {z(a') : < a' < a}, the graph G(a) is distributed uniformly over Q(z) = Q(z(t)), 



i.e. 



P{G(a) = G\{z(a') : < a' < a}} = 



1 



Km) 



(5.7) 



Proof. Fix z = (oj,t) with z\ > together with z' = (u',t') and G' € G(z') such that transition happens with 
positive probability. Let N(G'\z) be the number of pairs of graphs G G Q(z) and choices of the deleted c-node 
from Ri that would result in our algorithm producing G' . Clearly, the following relations need to be satisfied 
(primed letters correspond to the analogous sets in G') 



Ri C (J Rj for < i < K, R' K C R K , 

3=0 

and for < i < j < K we let qtj = IR^H Rj\. In turn, let R* K C R' K n be such that each a € i?^ had its 
degree decreased during the algorithm step (i.e. k a > k' a ). It follows that 

K 

d>l>q K , q °i - and qKK - Z K 

j=i+l 

where the first inequality is a consequence of the fact that upon the deletion of a degree I v-node at least (j — i) 
edges of c-nodes in R^ n Rj disappear compared to at least one edge disappearing from all the c-nodcs in Rk ■ 
We proceed to compute N(G'\z). First, select a v-node a to add to G' among the t + 1 = n — a" elements 
of U; by our choice of z", the degree of a is £ and we select a permutation of its I sockets that will be used to 
connect to the c-nodes and create G G Q (z) . 

Second, we sum over the set V of all the possible arrays Q. For each possible pair in this set there are 
(""^g*^*) ways of selecting nodes in R' Q to be assigned to R\, . . . ,Rk', for each 1 < i < K — 1 there are 
) ways of selecting nodes in R' { to be assigned to Ri+i, . . . , Rk', while there are ways of 



i)qij edges among 



901, 

exactly ( 2 

selecting those c-nodes in R' K to be assigned to R* K . We have thus allocated Eo<i<j<if U 
the £ emanating out of a. 

Third, we need to select the precise number (> K — j) of edges from our v-node a that we will connect to 
each of the q^K c-nodes in R'^ n Rk', at the same time we select the number of edges to be added to each of the 
qKK c-nodes in R* . Since we allocate in this way exactly I = £ — J2 0<i< j <K (j ~ fylij edges (emanating out of 
a), this can be accomplished in exactly 7r(Q) ways. 

Fourth, recall that we are counting not as much graphs G as pairs composed of a graph G and a particular 
c-nodc from R' Q n R\ of that graph, the choice of which during the step of our algorithm would produce G' . 
There arc g i such c-nodes. We have now proved that N(G'\z') is indeed given by the formula (5.3). 
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We are now ready to conclude the proof of the theorem by demonstrating (5.2) and (5.7). Notice that 
N(G'\z') depends on G' only through z' and recall that the graph-valued chain starts at some distribution 
G(0) which is uniform conditioned on each ensemble Q(z(0)). Consequently, by induction on a, as long as 
zi(o-) > 0, we have 

V n {G(a + 1) = G'\{z(a') : < a' < a}} N ^ G '^ 



zx(a) h(z(a)) 



and this is the same for all G' £ Q{z'). (Factor of z\{a) comes from us counting the pairs: graph and 
distinguished c-node of degree 1.) Thus (5.2) is implied by the cardinality of that ensemble being h(z'). Finally, 
with the graph G(cr) being constant for all a with a > f , these arguments have shown that the property (5.7) 
holds for all a. □ 



5.3 Smooth asymptotic kernel 

We next demonstrate that there indeed exists a probability kernel W that approximates the kernel W n on the 
domain H(e), defined below. Adopting the convention v = (v 3 , . . . ,vl) £ R+~ 2 , while u = (u x , . . . ,u K ) £ ~R K 
and x = (xi, . . . , xk+l-2) € R K+L ~ 2 , H(e) is defined by 

L K 

H(e) ee { x = (u,v) £ R^ x R^T 2 : x K > e; 1>^^-; d(v) - ^ ix t > e } , (5.8) 
where R + = {iel:i>0} and as before we have d(v) = Y^j=z3 v j- 

Proposition 5.2. Fix e £ (0,1) and H = H(e). For x = (u,v) £ H and Az = (Aw, At) £ Z K+L - 2 define 
transition kernel W(-\-) as follows: W(Az\x) — unless there exists a unique i with At^ = — 1,=^, in which 
case 

W(Az\x)^pr( Z" 1 ) P r 1 pf---p|f (5-9) 
d(v) \q - l,qi,...,q K J 

with 

(i + l)x i+1 . x K ^ K x K \ 

Pi=- — -77^ forO<i<K-2; p K _ 1 = — jv. ttt = -77^, (5.10) 

d(w) (A — l)!e^(A)ct(i>) <x(i>) 

where A is i/ie unique solution of 

/(A) ee Ae *-/W = ^)-^^ (5.11) 

and w/iere = - X)jLi+i > l»=o / or * = 0, . . . , K - 1 and q K = £ + J2iLi = ^ _ Silo 1 * > 0. 

TTien £/iere exists a positive constant k 2 = k 2 (L, K, e) such that for any z £ 1< K+L ~ 2 n nH with Wj > 1 /or 
a// i we /iaue 

HWnM^-WOIn-^H^^kan- 1 . 

Proof. We begin by defining 



VZ-, ,...,z,i-/ 



„-m (/ _ 1) ,^ nto 1 (^)- A "coeff[e g (x)^,x^] (g 12) 
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where |7^| < Ci/n for some positive constant c\ = ci(e,L) and the equality follows from the estimate 

my 1 = (1) 

W J d(r) [d(v)Y-i [ +ln 1 

itself a consequence of e < d(tT') < d(v) < L for n~ x z £ H and n~ x z' e H. We also define 



where for each Q e V 



- 3 (o) - n ( ECq1 qij ) P=° 9ii + qKK )tw). 

and the equality follows by simple algebra. Notice that W n (Az\z) = ir-i(z',z)ir 2 (Q,z',z) for z and z' in nH. 
We proceed to make a series of estimates to find the asymptotics of 7Ti, 7T2, and 7r3. 

First, the condition L > £ > £ — I implies that \T>\ < L K and that ir 3 (q) is uniformly bounded for q € T> by 
some constant c 2 = c 2 (L), since we always have 7r(Q) < e L . It follows that each term indexed by Q in the sum 
over V in 7r 2 is at most of the size 

c 2 (nL)^o<i< j <KQii+QK-i ( 513 ) 

and we notice that inequality X^o<i<j<ii"0 — *)%' + (fific — ^ implies that for all q such that either q^ > for 
some i + 1 < j or X^iLi Q(i-i)i + 9^^" < ^ we have (5.13) bounded by c 2 L L ^ 2 n e ^ 2 . Consequently the sum over 
such Q's only contributes at most c 3 n e ~ 2 for c 3 = c 2 L r2+l ~ 2 . 

It remains to consider Q such that q^ = for all i + 1 < j and X)i=i + Q'Ji'Jf = ^- I n this case, we 

define q-i-i = qu-Ui for 1 < i < K and qx = qKK- It is straightforward to verify that with this definition we 
have <7j = — J^jLj+i > for i = 0, . . . , if — 1, while additionally q > 1, an d <7#: = + X^i *^ z « — 0- l n 
particular, g is unique. Moreover, £ = qK + qK-i and consequently ir(q) = coeff[ei(x) £ , x e ] — 1 and 773(9) = 
( 9K ~qK 9K )- AU of this impiies 

* ?) - ( Z1 ~ ]) ff i z °){ z \ ) ( qK " + qK ) + w> 

Vg - 1/ V^-i/ W-i + g*7 \ Qk J 
where |7n^| < c 2 L L ~ 2 n e ~ 2 . Rescaling by n~ £+1 and using Xi = n~ 1 z i we recover 

( '~ 1)! ? ( r' ?0 = f / _1 )*r i n*r*r i+,jf +7S ,) , (5.i4) 

j— z 

where | 7 n 3 ^| < (c 2 L L ~ 2 + 07)71" 1 and we have used the inequalities 

a? - c 5 ™ < t ^tt < and x\ 1 - cgn" 1 < -. V \, ' < x] 1 

for any < q < L A Zj with positive constants C5 = c$(L) and c 6 = c 6 (L). 
Consequently, since — Azi = — % for 1 < i < K — 1 implies 

AT-l if-l x-i 

i=0 i=0 i=0 

substituting this into (5.12) and combining with (5.14) yields the estimate 

W n (A m = W (A^ gggffi^4f^^ + 7 W (5.15) 
coeff[e_ftr(x) z K jX d ] A"« )f - 1+ ' K 
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for some [7^1 < c 8 (e, L)n 1 and to obtain result (5.9) it only remains to show 



coeff[eg(x)^xj e K {\)^-^ ^ 
coeff[e K (x)^,x 5 ] A^-i+w 



< c 9 n \ 



where the positive term ei^A) 9 *" -1 A k ik-i docs no t depend on n. 
To this end, note that for A > and integers t, s > 1 we have 



(5.16) 



Px 



(i, S ) = coeff[ eK (x)*,x s ]A s e K (A)- t -P A K^JV i = s , 



(5.17) 



i=i 



where {iVi} i.i.d. Poisson(A) random variables conditioned on being greater or equal to K. Hence, in view of 
—Azk = z K — z' K = Qk-i and d — d' = Kq K _i + q K , estimate (5.16) is equivalent to 



P\(z K + Az K , d - (Kqx-i + q K )) 
P\(z K ,d) 



< CgU . 



(5.18) 



This will follow from a local CLT for the sum, Sk of i.i.d. variables Xi = (JVj — £)/M2(K, A) for £ = d/zx 
similarly to [DM08, Lemma 4.5]. First, a straightforward computation reveals that the first two cumulants of 
iVi are 



Mi (if, A) 
M 2 (K,X) 2 



Ea(Wi) 



\e K -i{\) 



Var A (JV!) = 



A[Ae g - 2 (A)e x (A) + e g -i(A)e K (A) - Aeg-i(A) 2 
' /\ A i -' 



and similarly the higher normalized moments M q (K, A) = E A [A^i — Mi (if, X)] q /M2(K, A) 9 , g > 3 can be com- 
puted. It is clear that each M q (K, A), g > 1 is bounded away from zero and infinity for A bounded away from 
zero and infinity. Second, recall that by our assumption of rT x z e H we have K + e/ p < £ < L/e and this 
implies that the unique non-negative solution of Mi (K, A) = £ (whose existence is shown as part of the proof of 
Lemma 5.3) is also bounded away from zero and inifinity. It follows that each moment M q (K, A) is uniformly 
bounded on H, as well as that p\(zx, d) = P(5fe = 0) and p\(zx + Azx,d — (Kqx-i + qx)) = ^{Sk' = a) with 
k = z K , k' — k = Az K e {— (L — 1), . . . , 0} and a = —(Kq K _i + q K + ^Az2)/M q {K, A). The last quantity, a is 
uniformly bounded on H and in the lattice of all possible values of Sk, which has span b = M^{K, A) -1 . At this 
point the proof is completed with exactly the same arguments as that of [DM08, Lemma 4.5]. □ 



Lemma 5.3. Functions x i— > pi, i = 0, . . . , K are three times differentiable for allx £ H with bounded continuous 
derivatives. 

Proof. Fix e > in the definition of H. We choose a < S < e and in the remainder of the proof consider H 5 , 
which will conclusively demonstrate differentiability at the boundary as the formulas for pj remain well-defined 
on H 5 for each Az (though this extension does not lead to an extended probability kernel, as for example pi < 
for z with X\ < 0). 

It is enough to show that partial derivatives (of all orders) of x ^> pi for < i < K exist in H 5 . This is clear 
for i < K — 2, whereas for i = K — 1, K we notice that function A A^/e^ (A) is bounded and smooth on any 
compact subinterval of (0, oo), so that we only need to show that x i— ► A(x), defined to be the solution of (5.11), 
is bounded away from and infinity (uniformly on H ) and infinitely differentiable. 

To this end, we first note that extending / to the whole interval [0, oo) by setting /(0) = K yields a 
monotone increasing, infinitely differentiable function from [0,oo) to \K,oo) and its first derivative is /'(A) = 
[(eif~i(A) + Aex-2(A))e_R-(A) — Ae^_i(A) 2 ]/ex(A) 2 , which is bounded below by some positive constant (compare 
coefficients in front of powers of A). As a consequence of the inverse mapping theorem, f~ x is well-defined 
and infinitely differentiable on [if, oo), which further implies that A(-) is well-defined, bounded and infinitely 
differentiable on H s with bounded derivatives, because xk > e — <5 > there. This completes the proof. □ 
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(5.21) 



To determine the mean F and covariance matrix G of the increment with distribution W(-|:c), observe 
that the random variables — (AT3, . . . , Atl) = —At have a multinomial law with parameters {1;S3, . . . ,Sl} for 
Sj = jvj/d(v) and conditionally on At we have (q — 1, qi, . . . , q K ) distributed with a multinomial law with 
parameters {£ — 1; po, • • • , Pk }, where i equals the unique j with Atj = — 1. A simple computation then reveals 
that F= (F( u ),F("))t for 

F^(x) = R'(l) (-^y +P1-P0, P2-P1,..., Pk-i-Pk-2, -Pif-i) (5.19) 
F(")(f) - (-s 3 , s L ) (5.20) 

where R(£) = E^U^- 7 '" 1 , so that R'(l) = d(w)" 1 Ej =3 j(j - whereas G(x) = Cov(Az) with Az = 

(Aw, At) is determined by (1 < i < j < K, 3 < I < k < K ) 

{ Cov(A^,A^) = R'(l)(p i l i ^ + Pi-i)+ [R"(l)-(R'(l)) 2 ](p,l l ^-p,-i) 2 
Cov(Aui,&jj) = R / (i)Mi= J --i + [R"(i)-(R'(i)) 2 ](P,-p t -i)(P J i^-P J -i) 

Cov(A^,At ; ) - 8i[i-l-R'(l)](pili 7 tjf-pi-i) 
Cov(AT;,ATfe) = -S;Sfe + s fc l;=fc 

5.4 Ensemble Q(n,m,y n ) 

In Section 5.2 we have only assumed that G(0) is such that conditioned on {G(0) € <?(•?)}, it is a uniformly 
random element of Q(z). This condition allows for variety of initial ensembles. In this section we define a 
particular initial ensemble. 

The inital ensemble Q = G(n,m,v n ) is characterized by non-negative integers n, m, {v„(j')}^ =3 and L > 3. 
We require that n = E^=3 v «(i) an d tnat there exists a distribution vector v — (vo3, • • • , w ol)^ an d a constant 
Co such that for each j we have |v„(j) — nvj\ < Co uniformly in n. For simplicity we also assume that v„(j) = 
for all n if vj — (this can be easily lifted, but the notational burden involved will obscure the issue). 

Let V(x) = X^=3 jvojX-' -1 and note that the asymptotic average degree of the v-nodes fj, = V(l) is indepen- 
dent of n. An clement in the ensemble is a graph 

G=(V 3 ,V 4 ,...,V L ;C;E), (5.22) 

where V3 II • • • II Vl = [n] is the set of v-nodes, with \Vj\ — v„(j), C = [m] is the set of c-nodes, and E is an 
ordered list of edges 

E = [(ai,Oi), (ax, 02), • ■ • , (ai,a h ); (a 2 ,a h+1 ), (a 2 ,a h+2 ), (a 2l a h+h ); • ■ • ; 

(a n ,ai 1+ ...+i n _ 1+ i), .. ., (a n ,ai 1+ ...+i n )] 

where a couple (a, a) appears before ((3, b) whenever a < [3 and each a £ Vj appears in exactly j pairs. The 
total number of graphs in this ensemble is 

\Q(n,m, Vn )\ = ( n Vn(L) )coeff[( e x ) m ,x"] (h„)! (5.23) 

where h„ = ^ =3 jv n (j). 

To sample from this distribution, first partition [n] into disjoint sets {Vj}f =3 and attribute j sockets to 
each v-node in the set Vj. Second, attribute k a sockets to each c-node a, where fc a 's are mutually independent 
Poisson(/i//?) random variables, conditioned on the event {J2a.ec ^ a = n M}- Finally, connect the v-node sockets 
to the c-node sockets according to an indepedently and uniformly chosen permutation of {1, . . . ,nfi}. This 
sampling procedure is used to establish the approximate mean and covariance matrix for the initial state 
z = (65, f) of our Markov chain. Clearly we only need to deal with w(0) = Lo n , P , as t(0) is non-random with 
entires t,(0) = v n (j) for 3 < j < L. 



24 



Lemma 5.4. Letuj rliP = (0)1,0)2) denote the number of c-nodes of respectively degree 1 and degree strictly greater 
than 1 in a graph G chosen uniformly at random from the non-empty ensemble Q(n, \np\,\i n ). Then, for any 
e > 0, there exist finite, positive constants Ki such that for all (allowed) n, all r, N , and p G [e, 1/e] 



||Ew„ jP - nv,p\\ < n 
P{||w„, P - Ew„, P || > r} < K X e 



sup sup 

veM N x2 SeR 2 



P{Uw„ jP < x} - V{n 1/2 V( p < x} 



< n 3 ri 



-r 2 /(K 2 n) 
-1/2 



(5.24) 
(5.25) 
(5.26) 



where Q is a Gaussian vector in K 2 with mean u p and covariance Q p where 



[Qp]l2 



= Pe 



(pe 7 , p(l - e 7 ) - pe 7 ) f 
pe- 2 -<(e-<- I + 7-7 2 ) 
- Ale - 27 (e 7 -l- 7 2 ) 

27 [( e 7 -l) + 7 ( e 7 -2)- 7 2 (l + 7 )] 



where 7 = V(l)/p is the average degree of a c-node and p = V(l) is the average degree of a v-node. 

Proof. Given n and m — [np\ , by construction the initial distribution of the v-nodes only affects the distribution 
of the c-nodes through the total number, h„, of the v-node sockets, which has to match the number of the c- 
node sockets. Moreover, we have |h„ — np\ < c . Hence the analysis reduces to that for the regular hypergraph 
ensembles, as in the proof of [DM08, Lemma 4.4]. We follow that proof (subtituting each occurence of nl with 
that of h„ and using the approximation by np), thus demonstrating (5.24) and (5.25), which correspond to 
[DM08, (4.18) and (4.19)] respectively, until the end of the paragraph containing the estimate [DM08, (4.25)]. 
The conclusion of that paragraph is that for each < e' < 1 there is a positive constant c = c (e') such that 
for each 76 [e', 1/e'] we have 



\~P(z) -G 2 (z\n'u p ;n'Q p ) 

z£Z 2 



< CqU 



-1/2 



(5.27) 



where n' = m/p, P is the probability distribution of the vector z n ^ p , and G2(-\x; A) is the Gaussian density in 
K 2 with mean x and covariance matrix A. 

At this point we diverge from the proof of [DM08, Lemma 4.4] to prove the stronger conclusion (5.26). 
Clearly (5.27) implies (from now on supressing dependence on p) 



sup sup 

veM Nx2 sen 2 



P{Uz„ < x} - G 2 (z\n / u;n'Q) 



Vz<x 



< CqU 



-1/2 



(5.28) 



Let x and y be Gaussian vectors with mean and covariance matrix pairs (n'u/y/n,n'€i/n) and (nu/y/n, nQ/n) 
and notice that, since Q is positive definite, the lowest eigenvalues of both the covariance matrices are bounded 
from below by a positive constant uniformly in n and 7. Then, since \n — n'\ < p < 1/e, an application of 
Lemma 4.2 to the family of convex sets C = {n~ 1 / 2 C : C e C, n > 0} gives a positive constant C\ such that for 
all n and 7 



sup|P„{V^xe C}-P„{Vny E C}\ < c^ 1 / 2 , 
etc 



(5.29) 



implying that we can exchange n' for n in (5.28). Now, the sets CVj.i?,™ = {z E R 2 : U(i*— nu) < x} are certainly 
convex and Borcl- measurable, so the proof will be complete upon demonstrating that for some C2 > 



sup 

CeC 



J2G 2 (z\d;nQ) - f G 2 (z|0;nQ)dz 



< c 2 n 



-1/2 



(5.30) 
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holds uniformly in n and 7. This is a consequence of the Euler-MacLaurin sum formula, as stated, for example, 
in [BR76, Theorem A. 4. 3], where we take / to be the Schwarz function G 2 (- | 0; Q) and where the first correction 
to the Gaussian distribution in Ai(x) of [BR76, (A. 4. 20)] is at most c^n^ 1 / 2 . □ 



Notice that for K = 2 the ODE system dy/d0 — F(y) where y = (u, v) becomes 



dui 

Is 

du 2 
~dd 

dVj_ 

dO 



= -1 + 



R'(l) 



d(v) 
R'(l) A 2 



A 2 



d(v) e 2 (A) 



62 (A) 



"2 



u 2 - Ui 



(5.31) 



d(v) 



for j = 3, . . . , L 



with the initial condition y(0) = y p = (u p ,vo)- (For p = p c , this is exactly the system defined by (1.16) and 
(1.17).) Its solution is given in Lemma B.l. 



5.5 Proof of Theorem 1.3 

Fix < 770 < p c - Let denote the first time at which zi(t) < 0. Since the 2-core is the stopping set including 
the maximal number of v-nodes, Lemma B.2 implies the existence of constants K\, k 2 such that for all p > i]on 
the probability that a random hypergraph from Q{n, \ np\ , v„) has a non-empty core of size less than Kin is at 
most K 2 n 1- '/ 2 . Thus setting r\x = (Kiryo A r/ )/2 and P(n, p) = P„{min < T <„(i_^ 1 ) ^i(r) < 0} we have 

|P n {r* <n-l}-P(n,p)\ < ^n" 1 / 2 , 

and fixing in the remainder of the proof the domain H = H(?7i), the statement of the theorem will follow from 
Theorem 1.1 once we show that its hypothesis holds. 

Let n = (ni,ri2, ■ ■ ■ , n^), where n\ = n 2 = and rij = l Vj =o f° r 3 < j < L, and let d = \\n\\ € Z. Then 
A = {x e 1R L : n ■ x = 0} = R d and the image of H n A under this last vector space homomorphism is a 
non-degenerate compact convex set D C R d . To demonstrate the hypothesis, observe that by Lemmas 5.1, 
5.2, and 5.3 our original chain has the countable statespace 1i d n nD, is bounded and density dependent, with 
a sufficiently smooth kernel. Moreover, Lemma 5.4 states that the parametrization of the intitial distribution 
satisfies the relevant conditions for Theorem 1.1 (where the p in Theorem 1.1 is (p — p c ) for the p in the Lemma 
5.4). It remains to be shown that the critical solution of the fluid limit ODE has the required properties and 
that the relevant matrix X is positive definite. 

We start with the former. The solution of 5.31 in Lemma B.l reveals that 6 = {9 € (0, 1) : xi(6, p c ) = 0} = 
{6 e (0, 1) : V(((6))/p c = -log(l - ((9))}. Since g(() = -log(l - C) is analytic on [0, 1) and lim cn g(Q = 00, 
so it can only coincide with the polynomial V at finitely many points in (0, 1). Hence O = {9 l c : i = 1, . . . , N}, 
as defined in the statement of the result. By the first paragraph of the proof wc must also have 9^ < 1 — 77 
and we can now set 9 = [0f + (1 - r?i)]/2. Differentiability of (0 p) i-> V(C)/p + log(l - C) on (0, 1) x [e, 1/e] 
and the inverse function theorem imply that V'(£) = g'{Cj = (1 — C) f° r each (6 2. Since we assumed that 
V"(Q/p c < (1 — ()~ 2 for each £ £ Z, inspection of the solution y — (u, v) in Lemma B.l and elementary calculus 
allow us to conclude that the critical solution remains in D° until time 9 (u 2 (9) > for 9 £ [0,0], same for Vj 
with v j > 0) except at times £ O, when it is tangent to the plane X\ = (and no other face of D), moreover 
satisfying the tangency conditions in (1.5) with di = and rfij = e*i. 

The proof will be complete once it is demonstrated that the p.s.d. symmetric matrix with entries (1 < i < 
J<N) 

[sk = [Qra(B e ^))t] n , 

is actually positive definite. Here Q and B are defined as in Theorem 1.1, where we take all the chain specific 
data (F, G, y and Qo) like in Theorem 1.3. Notice that £ is the covariance matrix of the centered Gaussian 
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vector Y in E w with entries [Y]j = X^i=i[^(^i)s»]ii where B(i, j) = H 9 i (9Q and {gj} are independent centered 
Gaussian vectors in R K+L ~ 2 with covariance matrices 



Vj = B c (^ c )G(y(C))(B c (^ c ))tdC + Q(0)l j=1 . 



It is now enough to show that [B(j, j)VjB(j, j^jii = [Vj]n > for every j = 1, . . . , N. Since det B c (6<) > 
and G is p.s.d., this will follow once we show that [B(e)G(a?)B(e)t]n > for all < e < e and x G D, where 
B(e) = H5 g j_ e (9i). This last claim is a consequence of Gn(x) > for x e D, B(0) = I and the continuity of 
B(-). □ 

Remark 5.5. We briefly explain, using the example of ensemble Q = Q{n, p, u£, v„), how to analyze other 
ensembles within this framework. Ensemble Q is defined similarly to Q(n, [pn\,\/ n ), except that the degrees of 
hypervertices are given by a distribution u£. There are exactly u£(i) hypervertices with degree i and we assume 
[P n \ = Ya^i 1 u n(*); Hf=\ * u £(*) = J2j=3 and |u£(j) - nu p j\ < c uniformly in n for some strictly 

positive vector u p — (u Pi i t . . . ,u Pi k), such that the map p i— > u p is twice continuously differentiate. 

To establish the FSS for the phase transition in this ensemble, first, notice that the proof of Proposition 5.2 
also establishes the simpler (do not require the CLT arguments) claim that the kernel W n is density dependent 
on 

f L K ) 

H(e) = < x = (u, v) E R+ x R^T 2 : 1 > ^ v 3 > e; d(v) = ^ ix t \ (5.32) 

{ 3=3 »=1 J 

considered as a subset of the unique (K + L — 3) -dimensional affine hyperplane that contains it (it follows 
from the definition of W n that a chain with this kernel remains in nH if started there). The asymptotic kernel 
in this case, W{-\-) is defined similarly to W{-\-), where we substitute H for H, take $k-i = Kxk / d(v) and 
Ik = Pk = ^ = 0, impose i = — ^ i=1 iAzi — X^o 1 1i> an< ^ interpret Pjf = 0° as 1. Second, the associated 
differential equations are given in terms of F and G computed in (5.19) and (5.21), respectively, and the initial 
condition is now y p = {u p ,vq) and <Q P = 0. The solution of such a system is provided in [LMSS01, Appendix 
BJ. The absence of small cores in this ensemble can be established with a variation of the proof of Lemma B.2. 
The analog of Theorem 1.3 for this ensemble now follows by essentially the same proof as above. 
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A Appendix: Proof of Lemma 2.3 

The matrix A(y(r/n, p)) and so also matrices A r are uniformly (in r and p) bounded in the operator norm, 
which implies uniform (in a, r and p) boundedness of ||B£|| and for n > n , also the existence of the inverses 
(BJ) -1 and a uniform bound (in a, r and p) on their norm. 

Moving on to demonstrate (2.18), let D n (r) = n~V*(r) — y(r/n,p), so that D n (0) — and D n (r + 1) = 
W T D n (T) + ^ n (r), where 



F(y(T/n,p))-F(y(6,p)) 



&0. 



The Lipschitz continuity of F(-) and y(-,-) together with the supremum bound on the integral give the uniform 
estimate 

£i(t) < c n~ 2 . 
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With D n (r) = J2a=o ^cr+i^i( r ) f° r t < n8 and BJ uniformly bounded, as explained in the first paragraph, we 
can deduce (2.18). 

To see that (2.19) holds, define D' n (a,T) = \\Bl~ 1 - B CT /„(r/n)||, so that in view of B^" 1 = B a/n (cr/n) = I 
we have D' n (a, a) — 0, while for r > a 

D' n (a,r + 1) < D'^^ + n- 1 A T Bl~ 1 + n [B ff/n (^±l) - B„ /n (£)] 



/: 



< -D^(cr, r) + n -1 sup 



A T B- 1 + A(#,p))B ff/ll (e) 



c!0 



a t b;- 



A(y(6,p))B„ /n (6) 



(A.l) 



Next, recall that A T = A(y(r/n, p))l 7 - e / ?i , so that boundedness (of A) and Lipschitz continuity (of y and 
give 



sup 



A r - A{y{6,p)) < c 1 n- 1 t TeIn +c 2 t Te j n . 



From the ODE (2.3) and that same boundedness of A it is apparent that 

||B C (0)-B C ,(0O||<C 3 (|0-01 + |C-C'I)- 



(A.2) 



(A.3) 



Uniform boundedness of all the matrices involved and the inequalities (A.2) and (A.3) imply that for 6 e 
[r/n, (r + l)/n] we have 

A T B;- 1 + A( ? 7(0,p))B ff/ „(0)|| < C4 ^(a,r) + c 5 n- 1 + C6 l TeJri 

which combined with the bound in (A.l) results in the estimate 

D' n {a, t + 1) < (1 + an-^D'^a, t) + c 5 n- 2 + c 6 n- 1 l TeJn 

and hence 

max D' n (a, t) < c 7 (l + |J„|)n _1 . 

0<a<T<n6 

Next, observe that for any d x d matrices V, HJ and V, V we have 

||VW t -V'U'OO^I < ||V-V'||||UV t || + IIV'UIWV- V'|| + ||V'||||U-U'||||V'||. 

Applying this estimate (first with U = V = Q(0,p), V = B (r/n) and V = BJ, and then with U = 
G(y(C,p)), U' = G(y(cr/n,p)), V = B f (r/n) and V = B[ nC] ) and subsequently using the bound (2.19), 
uniform boundedness of all the matrices involved and Lipschitz continuity of ?/(-, •) and G(-) gives 

||Bo(T/n)Qp(Bo(T/n))t-B5Q^(B5)t|| < c 8 (l + | J^n" 1 
||B c (r/n) G(y(C,p))(B c (r/n))t - B£ +1 G(^/n,p))(B; +1 )t|| < Cg (l + |j n |)„-i 

uniformly in r and £ < r/n. Finally, noticing that the sum over a G J„ n [0, r] in the definition (2.17) differs 
from the analogous sum over a € [0, r] by at most |~| J„|] uniformly bounded terms and comparing that last sum 
with the integral in (2.7) gives the desired result. □ 



B Appendix: ODE solutions and absence of small cores 

Here we provide solutions to the system of differential equations (5.31) that arose from our analysis of the 2-core 
problem in hypergraphs. This is a special case of the result obtained in [LMSS01]. 
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Lemma B.l. There exists a unique solution (6,p) \— > y(9,p) = (u(6 , p) , v{6)) in C 3 ([0, 9] x [— S, 5],R L ) to the 
ordinary differential system (5.31) with the initial condition y p — (u p ,Vo) and it satisfies the relation 



Ul (9,p) = h(e)[C(e)-l + e- h ^/P\ (B.l) 
u 2 (6,p) = pe- h ^'Pe 2 {h{9)/p) (B.2) 

where h{9) = V(C(6>)) and ((9) = cxp{- j° d(w(s))- 1 ds}. 

Proof. The existence and smoothness property of the solution comes from the corresponding properties of F, 
which are a consequence of Lemma 5.3. In order to prove that (B.l) and (B.2) indeed give a solution, we 
start by making the substitution Xi{9, p) = Xi(Q, p) for the monotonically decreasing continuously diffcrcntiablc 
function ( : [0, 1] — > [0, 1] defined in the statement of the lemma. Notice that £(0) = 1 and limefi ((9) — 0, since 
dd(v(s))/ds < —3 and lim s fi d(v(s)) = 0, so that ( is a bijection. 
Next, substituting 

x 1 (C,p)=V(C)[C-l + exp(-V(C)/p)] and x 2 ((, p) = pe v ^ e 2 (V(C)/p) 
for respectively x\ and x 2 in (5.11) with K = 2, we can verify that yj(() — yj(C — 1)C J ! which implies 

L 

d(i7(C)) = E^(0 = V(C)C and A(f(C),y(C))-V(C)/p. 

J =3 

Moreover, dC/d6» = -1/V(Q and R'(l) = V'(C)/V(C), so substituting for A, xx and x 2 in the ODE system w.r.t. 
9 reduces it to the following systems w.r.t Q 

^(0 = -V(C) + V'(C)e- v «)/"V(C)/p-V'(C)[C-l + e - V(<:)/p ] (B.3) 

^(C) - -V'(C) e - v ^V(C)/p (B.4) 

Simple differentiation now shows that the functions x\((,p) and x 2 ((,p) indeed satisfy this system, together 
with the initial condition. □ 

From our definition of the domain D it is clear that the Markov exit problem yields information about the 
existence of large 2-cores, that is 2-cores of size at least nn for some positive n. In the language of coding theory, 
this means that it detects large decoding failures (lying in the so-called waterfall region). In principle, there 
could exist cores of smaller than linear size (corresponding to decoding failures in the error floor region) . It 
is a consequence of the results in [OVZ05] that in ensembles with minimal v-node degree greater than 2 the 
probability of such occurence tends to zero as n increases. 

Here we prove an explicit bound on this small probability for our ensemble Q(n, to, v„), which is accomplished 
by estimating the probability that the hypergraph contains a small stopping set. A stopping set is defined to 
be a subset of the v-nodes such that the restriction of the hypergraph to this subset has no c-nodes of degree 1. 
Thus the 2-core is a stopping set including the maximal number of v-nodes. 

Lemma B.2. Consider the ensemble Q(n 7 \ np\,v n ) and let I = min{i : v ^ > 0}. Then for any e > there 
exists finite positive constants «i(e) and K 2 (e) such that for any p > e the probability that a random hypergraph 
from the ensemble Q(n, [np\,v n ) has a stopping set consisting of fewer than Kin v-nodes is at most k 2 w 1- ^ 2 - 

Proof. Write to = [np\ . For a positive k to be chosen, we will bound the expectation of the number S of 
stopping sets consisting of fewer than nm v-nodes. Indeed 

rem Lv [_d/2\ 

E « 5 = E E E p ^ ( R5 ) 

v—l d—lv r—1 
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where P v ,d,r is the expected number of stopping sets consisting of exactly v v-nodes, with total degree d and 
connected to exactly r c-nodes. We have the explicit representation 



Pv,d.r = coeff 



[J(l + yz^) v »W,yV 



1 i m 

m d \ r ) 



) coeff [(e x - 1 -x) r ,x d ] d\, 



(B.6) 



where the first factor is the number of choices of v v-nodes so that their total degree is d, m d counts the total 
number of ways of connecting these v v-nodes in the graph ensemble, and (™) coeff [(e x — 1 — x) r ,x d ] dl counts 
the number of ways of connecting r c-nodes to the v v-nodes resulting in a stopping set. Since 
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J](l + yzJ>0'),yV* 

3=1 
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Q and coeff [(e x - 1 - x) r , x d ] < 1 



(B.7) 



we have, for m > en, r < [d/2\ , lv < d < Lv and w = d— lv the bound 
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(B.8) 



We now fix k > small enough so that (cik'/ 2 x ) V (V Lk) < |, which allows us to obtain 



E. 



Km (L-l)v [d/2] 



w/2 



v — 1 it;— r— 1 
oo 



v=l 



r n u — 1 oo 



w=0 



r V v-l 



v=l 



and by comparison with the geometric series the last sum converges to a finite positive constant. Thus the 
thesis of the lemma follows with k± — ne and n 2 = Cie 1 ^ 1 / 2 Y^Li v L / 2 2 1 ~ v - 

□ 
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